Gaussian Process Regression: Active Data Selection and Test Point Rejection
نویسندگان
چکیده
We consider active data selection and test point rejection strategies for Gaussian process regression based on the variance of the posterior over target values. Gaussian process regression is viewed as transductive regression that provides target distributions for given points rather than selecting an explicit regression function. Since not only the posterior mean but also the posterior variance are easily calculated we use this additional information to two ends: Active data selection is performed by either querying at points of high estimated posterior variance or at points that minimize the estimated posterior variance averaged over the input distribution of interest or | in a transductive manner | averaged over the test set. Test point rejection is performed using the estimated posterior variance as a conndence measure. We nd for both a two-dimensional toy problem and for a real-world benchmark problem that the variance is a reasonable criterion for both active data selection and test point rejection.
منابع مشابه
Transductive Gaussian Process Regression with Automatic Model Selection
In contrast to the standard inductive inference setting of predictive machine learning, in real world learning problems often the test instances are already available at training time. Transductive inference tries to improve the predictive accuracy of learning algorithms by making use of the information contained in these test instances. Although this description of transductive inference appli...
متن کاملA KNN Based Kalman Filter Gaussian Process Regression
The standard Gaussian process (GP) regression is often intractable when a data set is large or spatially nonstationary. In this paper, we address these challenging data properties by designing a novel K nearest neighbor based Kalman filter Gaussian process (KNN-KFGP) regression. Based on a state space model established by the KNN driven data grouping, our KNN-KFGP recursively filters out the la...
متن کاملApproximation of Gaussian process regression models after training
The evaluation of a standard Gaussian process regression model takes time linear in the number of training data points. In this paper, the models are approximated in the feature space after training. It is empirically shown that the time required for evaluation can be drastically reduced without considerable loss in performance.
متن کاملGaussian Process Models for HRTF based Sound-Source Localization and Active-Learning
From a machine learning perspective, the human ability localize sounds can be modeled as a non-parametric and non-linear regression problem between binaural spectral features of sound received at the ears (input) and their sound-source directions (output). The input features can be summarized in terms of the individual’s head-related transfer functions (HRTFs) which measure the spectral respons...
متن کاملKNN-based Kalman filter: An efficient and non-stationary method for Gaussian process regression
The traditional Gaussian process (GP) regression is often deteriorated when the data set is large-scale and/or non-stationary. To address these challenging data properties, we propose a K-Nearest-Neighbor-based Kalman filter for Gaussian process regression (KNN-KFGP). Firstly, we design a test-inputdriven KNN mechanism to group the training set into a number of small collections. Secondly, we u...
متن کامل